Approximation Algorithms for Submodular Multiway Partition 

Chandra Chekuri * Alina Ene ^ 

January 20, 2013 



Abstract 

We study algorithms for the SUBMODULAR MULTIWAY PARTITION problem (SUB-MP). An instance 
of SUB-MP consists of a finite ground set V, a subset of k elements S — {si, S2, ■ • ■ , Sk} called terminals, 
and a non-negative submodular set function / : 2^ — > R+ on V provided as a value oracle. The goal is to 
partition V into k sets Ai, . . . ,Ak such that for 1 < i < k, G Ai and J2i=i fi^i) is minimized. SUB-MP 
generalizes some well-known problems such as the MULTIWAY Cut problem in graphs and hypergraphs, 
and the NODE-WEIGHED MULTIWAY Cut problem in graphs. SUB-MP for arbitrary submodular func- 
tions (instead of just symmetric functions) was considered by Zhao, Nagamochi and Ibaraki [jZSl . Previous 
algorithms were based on greedy splitting and divide and conquer strategies. In very recent work |@] we 
proposed a convex-programming relaxation for SUB-MP based on the Lovasz-extension of a submodular 
function and showed its applicability for some special cases. In this paper we obtain the following results 
for arbitrary submodular functions via this relaxation. 

• A 2 -approximation for SUB-MP. This improves the {k — l)-approximation from ||25| . 

• A (1.5 — l/A:)-approximation for SUB-MP when / is symmetric. This improves the 2(1 — 1/k)- 
approximation from E0il25l . 
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1 Introduction 



In this paper we consider the approximability of the following problem. 

SUBMODULAR MULTIWAY PARTITION (SuB-MP). Let / : 2^ R+ be a non-negative submodular set 
functioi|3 over V and let S = {si, S2, ■ ■ ■ , Sk} be a set of k terminals from V. The submodular multiway 
partition problem is to find a partition of V into Ai, . . . ,Ak such that s-i G Ai and X]i=i fi^i) is minimized. 
An important special case is when / is symmetric and we refer to it as S YM-SUB-MP. 

Motivation and Related Problems: We are motivated to consider Sub -MP for two reasons. First, SUB-MP 
generalizes several problems that have been well-studied. We discuss them now. Perhaps the most well-known 
of the special cases is the MULTIWAY Cut problem in graphs (Graph-MC): the input is an undirected edge- 
weighted graph G = {V, E) and the goal is remove a minimum weight set of edges to separate a given set 
of k terminals Q. Although the goal is to remove edges, one can see this as a partition problem, and in fact 
as a special case of Sym-Sub-MP with the cut-capacity function as /. Graph-MC is NP-hard and APX- 
hard to approximate even for = 3 Q. One obtains two interesting and related problems if one generalizes 
Graph-MC to hypergraphs. Let G = {V, 8) be an edge-weighted hypergraph. 

Hypergraph-MC is the problem where the goal is to remove a minimum-weight set of hyperedges to 
disconnect the given set of terminals. Hypergraph Multiway Partition problem (Hypergraph-MP) 
is the special case of Sym-Sub-MP where / is the hypergraph-cut function: f{A) = X^ee<5(A) w{e) where 
w{e) is the weight of e and 5{A) is the set of all hyperedges that intersect A but are not contained in A. The 
distinction between Hypergraph-MC and Hypergraph-MP is that in the former a hyperedge incurs a cost 
only once if the vertices in it ai^e split across terminals while in Hypergraph-MP the cost paid by a hyperedge 
is the number of non-trivial pieces it is partitioned into. Both problems have several applications, in particular 
for circuit partitioning problems in VLSI design We wish to draw special attention to Hypergraph-MC 
since it is approximation equivalent to the NODE-WEIGHTED MULTIWAY Cut problem in graphs (NODE-WT- 
MC) where the nodes have weights and the goal is to remove a minimum-weight subset of nodes to disconnect a 
given set of terminals HoKIll. An important motivation to consider SuB-MP is that Hypergraph-MC can be 
cast as a special case of it |[25l ; the reduction is simple, yet interesting, and we stress that the resulting function 
/ is not necessarily symmetric. Since NODE-WT-MC is approximation-equivalent to Hypergraph-MC and 
Hypergraph-MC is a special case of Sub -MP it follows that one can view Node-wt-MC indirectly as a 
partition problem with an appropriate submodular function. We believe this is a useful obsei-vation that should 
be more widely-known. In fact, SuB-MP (and related generalizations) were introduced by Zhao, Nagamochi 
and Ibaraki ll25l partly motivated by the applications to hypergraph cut and partition problems. 

A second important motivation to consider SuB-MP and Sym-Sub-MP is the following question. To 
what extent do current algorithms and techniques for important special cases such as Graph-MC and NODE- 
WT-MC depend on the fact that the underlying structure is a graph (or a hypergraph)? Or is it the case that 
submodularity of the cut function the key underlying phenomenon? For Graph-MC Dahlhaus et al. Q gave a 
simple 2(1 — 1/ A;) -approximation via the isolating cut heuristic. Queyranne [20] showed that this same bound 
can be achieved for Sym-Sub-MP (see also I25J). For Graph-MC, Calinescu, Karloff and Rabani El, in a 
breakthrough, obtained a 1.5 — 1/k approximation via an interesting geometric relaxation. The integrality gap 
for this relaxation has been subsequently improved to 1.3438 by Karger et al. |[T5l . Once again it is natural to 
ask if this geometric relaxation is specific to graphs and whether corresponding results exist for S YM-SUB-MP. 
Further, the current best approximation for SuB-MP is (A; — 1) ll25l and is obtained via a simple greedy splitting 
algorithm. SuB-MP generalizes NODE-WT-MC and the latter has a 2(1 - 1/A;) approximation |[TT1 but it is a 
non-trivial LP relaxation based algorithm. Therefore it is reasonable to expect that one needs a mathematical 
programming relaxation to obtain a constant factor approximation for SUB-MP. 

'a set function / : 2^' ^ R is submodular iff f{A) + f{B) > f{A nB) + f{A U B) for all A,B CV. Moreover, / is symmetric 
if fiA) = f{V - A) for all ACV. 
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In very recent work O we developed a simple and straightforward convex-programming relaxation for 
SUB-MP via the Lovasz-extension of a submodular function (we discuss this in more detail below). An in- 
teresting observation is that this specializes to the CKR-relaxation when we consider Graph-MC ! A natural 
question, that was raised in is whether the convex relaxation can be used to obtain a better than 2 approxima- 
tion for Sym-Sub-MP. In this paper we answer this question in the positive and also obtain a 2-approximation 
for SuB-MP improving the known {k — 1) -approximation f25\. We now describe the convex relaxation. 

A convex relaxation via the Lovasz extension. The relaxation SubMP-Rel for SuB-MP was introduced in 
H. It is based on the well-known Lovasz extension of a submodular function which we describe for complete- 
ness. Let y be a finite ground set of size n, and let / : 2^ — )• M be a real-valued set function. By representing a 
set by its characteristic vector, we can think of / as a function that assigns a value to each vertex of the boolean 
hypercube {0, 1}". The Lovasz extension / extends / to all of [0, 1]" and is defined as follow^: 



/(x) 



E 
0e[o,i] 



where x^ G {0, 1}" for a given vector x G [0, 1]" is defined as: = 1 if Xi > 6 and otherwise. Lovasz 
showed that / is convex if and only if / is submodular lITSl . 

Sub -MP is defined as a partition problem. We can equivalently interpret it as an allocation problem (also 
a labeling problem) where for each f G F we decide which of the k terminals it is allocated to. Thus we 
have non-negative variables x{v, i) for u G F and I < i < k and an allocation is implied by the simple 
constraint Yli=i ^(^) i) = I for each v. Of course a terminal Si is allocated to itself. The only complexity is 
in the objective function since / is submodulai\ However, the Lovasz extension gives a direct and simple way 
to express the objective function. Let Xj be the vector obtained by restricting x to the i'th terminal s^; that is 
Xj = {x{vi,i), . . . , x{vn, i)). If Xi is integral /(xj) = f{A,i) where Ai is the support of x^. Therefore we obtain 
the following relaxation. 
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The above relaxation can be solved in polynomial tim^ via the ellipsoid method ||4l. We give algorithms to 
round an optimum fractional solution to SubMP-Rel and obtain the following two results which also establish 
corresponding upper bounds on the integraUty gap of SubMP-Rel. 

Theorem 1.1. There is a (1.5 — 1/k) -approximation for Sym-Sub-MP. 

Theorem 1.2. There is a 2-approximation for SUB-MP. 

Remark 1.3. It is shown in fTTl that an a-approximation for NODE-WT-MC implies an a-approximation for 
the Vertex Cover problem. Therefore, improving the 2-approximation for SuB-MP is infeasible without a 

^The standard definition is slightly different but is equivalent to the one we give; see 1231 , l4l . 
^The running time is polynomial in n and log [maxscv .f{S))- 
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corresponding improvement for Vertex Cover. It is easy to show that the integrality gap of SubMP-Rel is 
is at least 2(1 — \/k) even for instances of Hypergraph-MC. The best lower bound on the integrality gap of 
SubMP-Rel for Sym-Sub-MP that we know is 8/(7 + the same as that for Graph-MC shown in HI. 

Remark 1.4. Related to SuB-MP and Sym-Sub-MP are /c-way partition problems k-way-Sub-MP and 
k-WAY-Sym-Sub-MP where no terminals are specified but the goal is to partition V into k non-empty sets 
Ai, . . . , to minimize Yl!i=i fi-^i)- When k is pait of the input these problems are NP-Hard but for fixed 
k, k-WAY-Sym-Sub-MP admits a polynomial time algorithm llQl while the status of K-WAY-SUB-MP is still 
open. For fixed k one can reduce K-WAY-SUB-MP to SUB-MP by guessing k terminals and this leads to a 
2-approximation via Theorem II. 21 improving the previously known ratio of (/c + 1 — 2y/k — 1) |[T9l . 

Our results build on some basic insights that were outlined in H where the special cases of Hyper GRAPH- 
MP and Hypergraph-MC were considered (among other results). In |@] a (1.5 — 1/ A;) -approximation for 
Hypergraph-MP and a min{2(l - 1/fc), i/A}-approximation for Hypergraph-MC were given where A 
is the maximum hyperedge degree and Hi is the i'th harmonic number Our contribution in this paper is a non- 
trivial, and technical new result on rounding SubMP-Rel (Theorem [T3] below) that applies to an aibitrary 
submodular function. The. fonnuJation of the statement of the theorem may appear natural in retrospect but was 
a significant part of the difficulty. We now give an overview of the rounding algorithm(s) and the new result. 
We then discuss and compai^e to prior work. 

1.1 Overview of rounding algorithms and the main technical result 

Let X be a fractional allocation and Yli f{^i) the con^esponding objective function value. How do we round x 
to an integral allocation while approximately presei^ving the convex objective function? The simple insight in 
||4l is that we simply follow the definition of the Lovasz function and do 6-rounding: pick a (random) threshold 
9 G [0, 1] and set x{vj,i) = 1 if and only if x{vj,i) > 6. Let x{9) be the resulting integer vector If we pick 6 
unifomily at random in [0, 1] then the expected cost of J2i = J2i fi'^i)- However, the problem is 

that x(^) may not correspond to a feasible allocation. Let A{i, 9) be the support of Xj, that is, the set of vertices 
assigned to Si for a given 9. The reason that x(6') may not be a feasible allocation is two-fold. First, a vertex v 
may be assigned to multiple terminals, that is, the sets A{i, 9) for i = I, . . . ,k may not be disjoint. Second, the 
vertices U{6) = V- U^^^Aii, 0) are unallocated. We let A{6) = ujLi^(i, 6) be the allocated set. 

Our fundamental insight here is that the expected cost of the unallocated set, that is f{U{9)), can be upper 
bounded effectively. We can then assign the set U{9) to an ai^bitrary terminal and use sub-additivity of / (since 
it is submodular and non-negative). Before we formalize this, we discuss how to overcome the overlap in the 
sets A{i, 9). If / is symmetric then it is also posi-modular and one can do a simple uncrossing of the sets to 
make them disjoint without increasing the cost. If / is not symmetric we cannot resort to this trick; in this 
case we ensure that the sets A{i, 9) are disjoint by picking uniformly in (1/2, 1] rather than [0, 1] (we call 
this half -rounding). Now the unallocated set and the expected cost of the initial allocation are some what more 
complex. We analyze both these scenarios using the following theorem which is our main result. The theorem 
below has a parameter 6 G (1/2, 1] and this corresponds to rounding where we pick 9 uniformly from the 
interval (1 — (5, 1]. 



Theorem 1.5. Let x be a feasible solution to SubMP-Rel. For 9 G [0, 1] let A{i,9) = {v \ x{v,i) > 9}, 
A{9) = u'y^-^A{i,9) and U{9) = V - A{9). For any 6 G [1/2, 1], we have 




By setting 5 = 1, we get the following corollary. 
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Corollary 1.6. 

E/>^) = E f{MhO)d9> f f{A{e))de+ f f{u{e))d9. 

i=i i=i-^o Jo Jo 

Theorem 1 1 . 5 1 gives a unified analysis of our algorithms for Sym-Sub-MP and SUB-MP. More precisely, we 
get Theorem [TTT] and Theorem 1 1.2l as rather simple corollaries. Corollary 1 1.6l is sufficient to show that the S YM- 
SUB-MP algorithm achieves a 1.5-approximation and that the SuB-MP algorithm achieves a 4-approximation. 
In order to show that the SuB-MP algorithm achieves a 2-approximation, we need the stronger statement of 
Theorem II .5 1 

1.2 Discussion and other related work 

Our recent work ||4l| considered the Minimum Submodular-Cost Allocation (MSCA); SuB-MP is a special 
case. MSCA also contains as special cases other problems such as uniform metric labeling, non-metric facility 
location, hub location and variants. The main insight in O is that a convex programming relaxation via the 
Lovasz extension follows naturally for MSCA and hence 6'-rounding based algorithms provide a unified way 
to understand and extend several previous results. The integrality gap of SubMP-Rel for Sym-Sub-MP 
and Sub -MP were posed as open questions following results for the special cases of Hypergraph-MC and 
Hypergraph-MP. These results subsequently inspired the formulation of Theorem 1 1.5 1 

Geometry plays a key role in the formulation, rounding and analysis of the relaxation proposed for Graph- 
MC by Calinescu, Karloff and Rabani f3'|; they obtained a 1.5 — l/Zc approximation. The subsequent work of 
Karger et al. exploits the geometric aspects further to obtain an improvement in the ratio to 1.3438. If one views 
Graph-MC as a special case of Sym-Sub-MP then the function / under consideration is the cut function. 
The cut function / can be decomposed into several simple submodular functions, corresponding to the edges, 
each of which depends only on two vertices. This allows one to focus on the probability that an edge is cut 
in the rounding process. Our work in ||4l for Hypergraph-MP and Hypergraph-MC is also in a similar 
vein since one can visualize and analyze the simple functions that arise from the hypergraph cut function. Our 
current analysis differs substantially in that we no longer have a local handle on /, and hence the need for 
Theorem 1 1.5 1 It is interesting that the integrality gap of SubMP-Rel is at most 1.5 — 1/k for any symmetric 
function /, matching the bound achieved by |3J for Graph-MC. Our rounding differs from that in |]3]; both 
do ^-rounding but our algorithm uncrosses the sets A{i, 9) to make them disjoint while CKR-rounding does 
it by picking a random permutation. One can understand the random permutation as an oblivious uncrossing 
operation that is particularly suited for submodular functions that depend on only two variables (in this case the 
edges); it is unclear whether this is suitable for arbitrary symmetric functions. 

As weremai-ked, Sym-Sub-MP and SuB-MP were considered in several papers ||20l|25][T9] with HYPERGRAPH- 
MC and k-WAY-Hypergrap-Cut as interesting apphcations for SuB-MP. These papers primarily relied on 
greedy methods. It was noted in ||25l that Hypergraph-MC and NODE-WT-MC are essentially equivalent 
problems. Garg, Vazirani and Yannakakis |[II] gave a 2(1 — l/A;)-approximation for NODE-WT-MC ifTTl via a 
natural distance based LP relaxation; we note that this result is non-trivial and relies on proving the existence 
of a half-integral optimum fractional solution. Viewing NODE-WT-MC as equivalent to Hypergraph-MC 
allows one to reduce it to SuB-MP, and as we noted in H SubMP-Rel gives a new and strictly stronger 
relaxation for NODE-WT-MC. The previous best approximation for SuB-MP was {k — 1) |[25l. As we al- 
ready remarked, obtaining a constant factor approximation for SuB-MP without a mathematical programming 
relaxation Uke SubMP-Rel is difficult given the lack of combinatorial algorithms for special cases Uke NODE- 
WT-MC. 

Submodular functions play a fundamental role in classical combinatorial optimization. In recent years there 
have been several new results on approximation algorithms for problems with objective functions that depend on 
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submodular functions. In addition to combinatorial techniques such as greedy and local-search, mathematical 
programming methods have been particularly important. It is natural to use the Lovasz extension for problems 
involving minimization since the extension is convex; see |[T4l [121 U for instance. For maximization problems 
involving submodular functions the multilinear extension introduced in ||2l has been useful ll22l[T6l[T7ll23l l6l. 

2 Symmetric Submodular Multiway Partition 

We consider the following algorithm to round a feasible solution x to SubMP-Rel. 



SymSubMP-Rounding 

let X be a feasible solution to SubMP-Rel 
pick £ [0,1] uniformly at random 
for z = 1 to 

A{i,e) ^ {v I x{v,i) > 0} 

U{6) ^v-A{e) 

for z = 1 to 

{{uncross A'^,--- ,A'J) 

while there exist i ^ j such that A'^ O A'j ^ (d 

if(/(^o+/(^;--^o</(^D+/(^;)) 

a;. ^ a;. - A^ 

else 

A'^ ^A[- a;. 

return K,-- ■ Ut/(g)) 

We prove the following theorem. 

Theorem 2.1. Letx be a feasible solution to SubMP-Rel. If f is a symmetric submodular function, the algo- 
rithm SymSubMP-RouNDING outputs a valid multiway partition of expected cost at most 1.5 • f{xi). 

The algorithm does 6'-rounding in the interval [0, 1] to obtain (random) sets A{i, 6) fox i = 1, . . . ,k. Let 
OPTpRAc = Eti/(xi). Note that E[/(A(i,0))] = /(x^) and hence Eti lE[/(^(i, 0))] = Eti /(xi) = 
OPTpRAc- The lemma below shows that the uncrossing operation does not increase the cost. This is was used 
in the context of multiway cuts previously ET\ IH ; we include the proof for completeness. 

Lemma 2.2 (H). Let A'^, . . . , A'j, denote the sets after uncrossing the sets A{1, 9), . . . , A{k, 9). If f is a 
symmetric submodular function then U^^^yl- = U^^jj4(i, 9) and 

k k 
Y,f{A^,)<Y.f{A{i,9)). 

i=l i=l 

Proof: In each uncrossing step we replace A[ and A'.j either by A'^ and A'^ — A[ or by A'^ — A'j and A'j. Since 
/ is submodular and symmetric, / is posi-modular; that is, for any two sets X and Y, f{X) + f{Y) > f{X — 
Y) + f{Y - X). Therefore, for any two sets X and Y, min{/(X -Y) + f{Y), f{X) + f{Y - X)} is at most 
f{X) + f{Y). Thus it follows by induction that ^f^^ f{A'-) < ^^^^ f{A{i, 9)) and uf^^A^ = U^=iA{i, 9). 

□ 

Lemma 2.3. If f is a symmetric submodular function, 

E [/(C/(^))]<i0PTp,,c. 
0,1 z 
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Proof: By setting 5 = 1 in Theorem 1 1.51 we get 

0PTpRAc> C f{v -u{e))de+ C f{u{e))d9. 

Jo Jo 
Since / is symmetric, f{V — U{6)) = f{U{9) for all 9 and hence, 

OPTpRAc > 2 [' f{U{9))d9 = 2 E [f{U{9))]. 
Jo ee[o,i] 

□ 

The random partition returned by the algorithm is {A[, . . . ,A'f,_^,A'i^ U U{9)). A non-negative submodular 
function is sub-additive, hence f{A'j^ U U{9)) < f{A'i^) + f{U {9)). The expected cost of the partition is 

fe-l k 

E[/(^^)] + E[f{A', U U{9))] < nfiK)] + nfiU{9))] 

i=l 1=1 

k 

< Y 9))] + E[f{U{9))] (Using LemmallH) 

i=l 

< OPTpRAc + ^OPTpRAc (Using LemmaUll) 
= 1.50PTpRAc. 

This finishes the proof of Theorem 12.11 It is not hard to verify that the algorithm runs in polynomial time. One 
can easily derandomize the algorithm as follows. The only randomness is in the choice of 9. As 9 ranges in the 
interval [0, 1], the collection of sets {A{i,9) \ I < i < k} changes only when 9 crosses some x{vj,i) value. 
Thus there are at most nk such distinct values. We can try each of them as a choice for 9 and pick the least cost 
partition obtained among all the choices. 

Achieving a (1.5 — l//i;)-aproximation: We can improve the approximation to 1.5 — as follows. We relabel 
the terminals so that k = arg maX]^<j<;j /(xi). We perform ^-rounding with respect to the first k — 1 terminals 
in order to get the sets A{i, 9) for each i ^ k, and we let U{9) = V — Ui<i<A:_iA(f , 9). We uncross the sets 
{A{i, ^) I 1 < i < A;} to get k—1 disjoint sets A'^, and we return (^4'^^, • • • , A'j^_i, U {9)). We can prove a variant 
of Theorem 1 1.5 1 that shows that the expected cost of U{9) is at most OPTprac/2, even when U{9) is the set 
of all vertices that are unallocated when we perform 6'-rounding with respect to only the first k' terminals, for 
any k' < k. The proof of this extension of Theorem II. 5l is notationally and technically messy (and somewhat 
non-trivial), and we omit it in this version of the paper. The total expected cost of the sets A[, - ■ ■ , A'^^^i is at 
most (1 — l/A;)OPTpRAc (since we saved on /(xfc)), and the expected cost of U (9) is at most OPTprac/2. 



3 Submodular Multiway Partition 

In this section we consider SUB-MP when / is an arbitrary non-negative submodular function. We choose 
9 G (1/2, 1] to ensure that the sets {A{i, 9) \ 1 < i < k} aie disjoint. 

SubMP-Half-Rounding 

let X be a feasible solution to SubMP-Rel 
pick 6* e (1/2, 1] uniformly at random 
for i = 1 to fc 

A{i,e) ^ {v I x{v,i) > 0} 
A{0)^[ji<.^kM^,O) 

U{9) ^v- A{e) 

return (A(l, 0), ■ ■ ■ , A{k - 1, 9),A{k, 0) U U{0)) 
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Proof of Theorem ll.2t In the following, we will show that SubMP-Half-Rounding achieves a 2-approximation 
for Sub -MP. As before, let OPTfrac = Ei=i /(xi)- 

Since / is subadditive, the expected cost of the partition 

returned by SubMP-Half-Rounding is 



E 

9e(i/2,i] 



fe-i 

E 

i=l 



fiAii,9)) + f{Aik,9)uUi9)) 



< E 
ee(i/2,i] 




fiA{i,9)) + fiU{9)) 



f{Aii,9))d9+ / f{Ui9))d9 



1/2 



E 



1/2 



f{Aii,9))d9+ f{U{9))d9 



1/2 



To show that the expected cost is at most 20PTprac it suffices to show that J2i=i lo^'^ fi^ih^^))'^^ — 
Ii/2 f{U{9))d9. Setting (5 = 1/2 in TheoremOwe get 



E 



1/2 



f{Aii,9))d9> 



1/2 



> 



1/2 



f{V-Ui9))d9+ / fiUi9))d9 
Jo 

f{U{9))d9 if is non-negative) 



Thus SubMP-Half-Rounding achieves a randomized 2-approximation for SUB-MP. The algorithm can be 
derandomized in the same fashion as the one for symmetric functions since there are at most nk values of 9 
(the x{vj,i) values) where the partition returned by the algorithm can change. □ 

Improving the factor of 2: As we remarked earUer the Vertex Cover problem can be reduced in an ap- 
proximation preserving fashion to SUB-MP, and hence it is unlikely that the factor of 2 for SUB-MP can be 
improved. However, it may be possible to obtain a 2(1 — l/fc)-approximation. A natural algorithm here is 
to do half-rounding only with respect to the first k — 1 terminals, where k = argmax^ /(xj), and assign all 
the remaining elements to k. We have so far been unable to strengthen Theorem 11.51 to achieve the desired 
improvement. 



4 Proof of Main Theorem 

In this section we prove Theorem 11.51 our main technical result. We recall some relevant definitions. Let x be 
a solution to SubMP-Rel. We are interested in analyzing 6'-rounding when 9 is chosen uniformly at random 
from an interval [1 — 6, 1] for some 6 > 0. For a label i let A{i, 9) = {v £ V \ x{v, i) > 9} be the set of all 
vertices that are assigned/allocated to i for some fixed 9. Note that for distinct labels i, i' the sets A{i, 9) and 
A{i', 9) may not be disjoint if 6* < 1/2, although they are disjoint if 6* > 1/2. Let A{9) = Ui<j<A; ^) be 
the set of all vertices that are allocated to the temiinals when 9 is the chosen threshold. We let U{9) = V — A{9) 
denote the set of unallocated vertices. With this notation in place we restate Theorem 1 1.5 1 

Theorem 4.1. For any 6 € [1/2, 1], we have 

V [' f{A{i,9)d9>{k5- 6 -!)/{$)+ [' f{A{9))d9+ [' f{U{9))d9. 
j^]^ Jo Jo Jo 

The proof of the above theorem is somewhat long and technical. At a high-level it is based on induction 
on the number of vertices with a particular ordering that we discuss now. In the following, we use i to index 
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over the labels, and we use j to index over the vertices. For vertex Vj, let aj = maxj x{vj,i) be the maximum 
amount to which x assigns Vj to a label. We relabel the vertices such that < ai < a2 • • • < On < 1- For 
notational convenience we let uq = and a„+i = 1. Further, for each vertex Vj we let £j be a label such that 
Oj = x{vj,ij); note that £j is not necessarily unique unless aj > 1/20 

We observe that in 0-rounding, vj is allocated to a terminal (that is Vj € A{6)) if and only if ^ < aj, 
otherwise vj € U (6) and thus it is unallocated. It follows from our ordering that U{6) = {vi,V2, • • • , "fj-i} iff 
9 E {oij-i, aj] and in this case A{9) = V — U{9) = {vj, . . . , Vn}- Thus, prefixes of the ordering given by the 
a values are the only interesting sets to consider when analyzing the rounding process from the point of view 
of allocated and unallocated vertices. To help with notation, for 1 < j < n we let Vj = {vi, V2, ■ ■ ■ ^Vj} and 
Vo = 0. The following proposition captures this discussion. 

Proposition 4.2. Let = and let j be any index such that 1 < j < n. For any 9 G (aj_i,Qj], A{9) = 
V - Vj-i and U{9) = Vj-i. 

It helps to rewrite the expected cost of f{A{9)) and f{U{9)) under 0-rounding in a more convenient form given 
below. 

Proposition 4.3. Let r G [0, 1], and let h be the largest value of j such that aj < r. We have 

r f{A{9))d9 = V a.ifiV - V;_i) - f{V - V,)) + rf{V - V^) 
Jo 

and 

r f{U{9))d9 = J2 o^jifiVj^i) - f{y,)) + rfiVh). 
Proof: Recall from Proposition 14 . 2 1 that A{9) = V — Vj-i when 9 € (oj-i, aj]. Therefore, 

f f{A{9))de = YT fiA{9))d9+ f f{A{9)d9 

h 

= J](a, - a,.i)f{V - Vj.i) + (r - ah)f{V - Vh) 

h 

= rf{V -Vh) + Y - ^J-i) - /(^ - ^j))- 

The second identity follows from a very similar argument. □ 



The inductive approach: Recall that numbering the vertices in increasing order of their a values ensures 
that U{9) is Vj for some < j < n. Let Xj be the restriction of x to Vj. Note that Xj gives a feasible allocation 
of Vj to the k labels although it does not necessarily con^espond to a multiway partition with respect to the 
original terminals. Also, note that the function / when restricted to Vj is still submodulai" but may not be 
symmetric even if / is. In order to argue about Xj we introduce additional notation. Let Aj{i, 9) = A{i, 9)r\Vj, 
Aj{9) = A{9) n Vj, and Uj{9) = U{9) n Vj. In other words Aj{9) and Uj{9) are the allocated and unallocated 
sets if we did 0-rounding with respect to Xj that is defined over Vj. 

*An alert reader may notice that we do not distinguish between terminals and non-terminals. In fact the theorem statement does not 
rely on the fact that terminals are assigned fully to their respective labels. The only place we use the fact that x{si, i) = 1 for each i is 
to show that 6'-rounding based algorithms produce a valid multiway partition with respect to the terminals. 
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Let pj = Yl!i=i lo 9))d9\ we have po = k5f{fl>). Note that the left hand side of the inequality 

in Theorem 11.51 is p„ = "^i^i Jq f{An{i,9))d6, since An{i,0) = A{i,9). To understand p„ we consider 
the quantity pj — which is easier since pj and pj^i differ only in Vj. Recall that £j is a label such that 
aj = max^^^ x{vj,i). The importance of £j is that if vj is allocated then it is allocated to £j (and possibly to 
other labels as well). We express pj — pj^i as the sum of two quantities with the term for ij separated out. 

Proposition 4.4. 

Pj-Pj~i = [\f{Ajii,,9)) - fiA,_,{i„e))d9 + y] [\fiA,ii,e)) - f{Aj^^{i,e))de. 

We prove the following two key lemmas by using of submodularity of / appropriately. 
Lemma 4.5. For any 6 € [1/2, 1] and for any j such that 1 < j < n, 

[\f{A,{i,e)) - f{A,.i{i,e))de > /{Vj) - + a,(/(y,_i) - f{v,)). 

Summing the left hand side in the above lemma over all j and applying Proposition 14. 3 1 with r = 1 we obtain: 
Corollary 4.6. 

ElE ^\fiAj{^,e))-f{AJ^l{^,9))de] > f f{U{d))de - f{%). 

Our second key lemma below is the more involved one. Unlike the first lemma above we do not have a clean 
and easy expression for a single term {f{Aj{£j, 9)) — f{Aj-i{£j, 9)) d9 but the sum over all j gives a nice 
telescoping sum that results in the bound below. 

Lemma 4.7. Let 5 G [0, 1] and let h be the largest value of j such that aj < 5. 

if{Aj{ij,9))-f{Aj.,{ij,9))d9j > f{Ai9))d9-5fm. 

The proofs of the above lemmas are given in Sections 14. 1 1 and |42] respectivelv. We now finish the proof of 
Theorem 11.51 assuming the above two lemmas. 

Proof of Theorem ll.St Let h be the largest value of j such that aj < 5. From Proposition 14.41 we have 

k „5 



V / f{A{i, 9))d9 = = po + - Pj-l) 

^=l j=l 

= Po + Y. \ / {f{Aj{ij,9))-f{Aj.^{ij,9))d9 + Y, / {f{Aj{i,9))-f{Aj^i{i,9))d9 
.7=1 Wo 



(Use Proposition 14.4 



r5 n f „S \ 

> po + y f{A{9))d9 - 5fm + E E y ^)) - ^)) (Use Lemmagj 

>Po+ f f{A{9))d9-5f{%)+ C f{U{9))d9 - f{%) (Use Corollary EI]) 
Jo Jo 



> (M-5-l)/(0)+ / f{A{9))d9+ f{U{9))d9. 
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We used pq = 5kf{$) in the final inequality. □ 
4.1 Proof of Lemma |45l 

Recall that the lemma states that for S G [1/2, 1] and for any j, 

J2 [\fiMi,e)) - fiA,.,ii,9))de > fiVj) - + a,(/(y,_i) - /(y,)). 

Proof of Lemma l45t Fix j and label i. We have 

{f{A,{t,e)) - f{Aj.i{i,e))de = / UiMiM - f{A,_iii,e))de 



since Aj{i,9) = Aj-i{i,9) when 9 is in the interval {m.m{S,x{vj,i)),6] (or the interval is empty). When 
9 < x{vj,i) we have Aj{i,9) = Aj-i{i,9) + Vj. Since / is submodular and Aj-i{i,9) C Vj-i, it follows 
that, for any 9 < x{vj,i), we have 

f{A,ii,9))-f{A,^i{i,9)) = f{Aj.^{i,9)+v,)-f{Aj^i{i,9)) > /(V,.„,+^;^.)-/(^i-i) = f{Vj)-f{V,^i). 
Therefore, 

5 /•min{S,x{vj 

if{Aj{i,9)) - f{Aj^^{i,9))d9 = / {f{Aj{i,9)) - f{A,^,{i,9))d9 

Jo 

> / if{V,)-fiV,.i))d9 
Jo 

= mm{6,x{vj,i)){f{Vj) - fiVj^i)). 

Note that, for any i / £j, x{vj,i) < 6: if aj < S, the claim follows, since x{vj,i) < aj; otherwise, since 
6 > 1/2 and '^ix{vj,i) = 1, it follows that x{vj,i) < 6 for all i ^ Ij. Therefore, by using the previous 
bound, 

f<5 



^ / (/(A,(i,0)-/(A,_i(z,^))d0>^min(5,x(7;,,i))(/(y,)-/(y,_i)) 

= 5^x(^„i)(/(y,)-/(v,_i)) 

= ii-xivj,i,)){fiyj)-fiyj~i)) 

= fiVj) - fiVj^i) + ajifiVj^i) - f{Vj)). 



□ 



4.2 Proof of Lemma |47] 

We recall the statement of the lemma. Let 6 G [0, 1] and let h be the largest value of j such that aj < 6. Then 

if{Aj{ej,9))-f{Aj.,iij,9))d9j > f{Ai9))d9-5fm. 
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Our goal is to obtain a suitable expression that is upper bounded by the quantity {f{Aj{£j, 6))—f{Aj-i{£j, 0))d6. 
It turns out that this expression has several terms and when we sum over all j they telescope to give us the de- 
sired bound. 

We begin by simplifying jQ{f{Aj{ij,6)) — f{Aj^i{£j, 9))d6 by applying submodularity. The following 
proposition follows from the fact that / is submodular and Aj{ij, 6) C Aj{6). 

Proposition 4.8. For any j such that 1 < j < n, 

S />min(oj,(5) 

{f{Aj{i,,e)) - f{A,.i{e,,e)))de > / {f{A,^i{e) + v,) - f{A,^r{e)))de. 

Jo 

Proof: If 61 G [0, min((5, q^)] wehave A 9) = Aj_i{ij,e) +Vj.lf9 £ {mm{6,aj),6] then Aj{ij, 9) = 
Aj-i{ij,9). Therefore 

S /•min(<5,aj) 

{f{Aj{£j,9)) - f{A,.i{e„9))) d9 = / {f{Aj{£„9)) - f{A,_i{lj,9))) d9 

Jo 

min{S,aj) 

{f{A,.i{e,,9) + Vj)-f{A,_i{ej,9)))d9 

Since / is submodular and Aj^i{ij, 9) C Aj^i{9), it follows that, for any 9 < aj, 

f{A,{ij,9)) - f{A,_,{£„9)) > f{A,_,i9) + Vj) - f{Aj.i{9)) 
and the proposition follows. □ 

Let A,- = j;^^{f{Aj^i{9) + Vj)-f{Aj^,{9)))d9,andA, = {f{Aj^,{9) + v,) - f{Aj^i{9))) d9. 
Note that the right hand side of the inequality in Proposition 14.81 is equal to Aj if j < h, and it is equal to 
Aj — Aj otherwise. Proposition |4.9| and Proposition 14. 1 1 [ express Aj and Aj in a more convenient form. 

Let Vj/j = {vj',Vj'^i, • • • , Vj} for all / and j such that j' < j; let Vj'j = for all / and j such that 
/ > J- 

Proposition 4.9. 

n 

j'=i 

Proof: It follows Proposition |4!2]that, if 6* G (aj'_i, aj/], A{9) = Vj/^n and Aj{9) = Vj'j. Therefore 
A,= r\f{A^_,{9) + Vj)-f{A,^mde = j2 r {f{A,^i{9) + v,)-f{A,^i{9)))d9 

Jo y^^Ja^,_^ 

j'=i 

n 

j'=l 

The last line follows from the fact that, if j' > j, Vjtj = Vj'^j-i = 0- D 
The corollary below follows by simple algebraic manipulation and is moved to Appendix [A] 
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Corollary 4.10. 

n n 

We now consider Tj. 

Proposition 4.11. For all j > h where h is the largest index such that < 5, 

n 

A, = {au+i - 6){f{Vh+i,,) - f{VH,j-i)) + («/ - «i'-i)(/C^/,.) - 

j'=h+2 

Proof: For notational convenience let /3h = 6 and /3j = aj for all j > h. It follows Proposition 14.21 that, if 

9 e (/3j/_i,/3j/], A{e) = Vf^n and Aj{e) = Vfj. Therefore 



r\f{A,^i{9) + v,)-f{A,^,mde = ^ f' {f{A,^m + Vj)-f{Aj^m))de 

i 

j'=h+i 

n 

j'=h+l 

The last hne follows from the fact that, if j' > j, Vjij = l^'j-i = ^- The lemma follows by noting that 

n 

J2 - Pf-i)ifiyf,j) - f(.yj',j-i)) = - - 
j'=h+i 

n 

+ E («,'-«,'-i)(/(v;',,)-/(v;',,_i)). 

j'=h+2 

□ 

The corollary below follows by simple algebraic manipulation and is moved to Appendix lAl 
Corollary 4.12. 

n n 

^ A,= ^ a,{f{V-V,^i)-f{V-V,))-6{f{V-VH)-fm)- 

j=h+l j=h+l 

Now we finish the proof. 

Proof of Lemma I4.7t We apply Proposition 14.81 in the first inequality below, and then Corollary 14.101 and 
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Corollary 14.1 2 I to derive the third line from the second. 



ifiMij,9))-f{A,.,{£„9)))de > u{A,.m + v,)- f{A,.,{e)))de 

.-1 -^0 JO 



3=1"'' j=l 



n 

j=l j=h+l 
h 

J2 (^jifiV - - f{V - Vj)) + 5{f{V - Vh) - /(0)) 
fiA{e))d9-6fm. 



The last equality follows from Proposition 14.31 □ 

5 Conclusions and Open Problems 

The main open question is whether the integrality gap of SubMP-Rel for Sym-Sub-MP is stricly smaller 
than the bound of 1.5 — 1/A; we showed in this paper. Karger et al. |[T5l rely extensively on the geometry of the 
simplex to obtain a bound of 1.3438 for Graph-MC via the relaxation from ||3]- However, we mention that 
the rounding algorithms used in ||T5| have natural analogues for rounding SubMP-Rel but analyzing them is 
quite challenging for an arbitrary symmetric submodular function. 

Zhao, Nagamochi and Ibaraki |[25l considered a common generalization of SUB-MP and K-WAY-SUB-MP 
where we are given a set 5 of terminals with |5| > k and the goal is to partition V into k sets Ai, . . . ,Ak 
such that each Ai contains at least one terminal and Yl\=i fi^i) minimized. Note that when |5| = A; we 
get SuB-MP and when S = V we get K-WAY-SuB-MP. The advantage of the greedy splitting algorithms 
developed in |[25l is that they extend to these more general problems. However, unlike the case of SuB-MP, 
there does not appear to be an easy way to write a relaxation for this more general problem; in the special case 
of graphs such a relaxation has been developed 0. An important open problem here is whether the A;- way cut 
problem in graphs admits an approximation better than 2(1 — l/k). 

Related to the above questions is the complexity of K-WAY-SUB-MP when is a fixed constant. For 
S YM-SUB-MP a polynomial-time algorithm was claimed in ll20l although no formal proof has been published; 
this generalizes the polynomial-time algorithm for graph fc-cut problem first developed by Goldschmidt and 
Hochbaum |fT3|. There has been particular interest in the special case of K-WAY-SUB-MP, namely, the hyper- 
graph A;-cut problem; a polynomial time algorithm for = 3 was developed in |[24ll and extended to SuB-MP in 
|[T9l . Fukunaga Q gave polynomial time algorithms when k and the maximum hyperedge size are both fixed. 
The following is an an open problem. Does the hypergraph A:-cut problem for k = 4 have a polynomial time 
algorithm? 
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A Omitted proofs from Section 143 



Proof of Corollary |4.10t It follows from Proposition |4.9| that 

n n n 

j=i j'=i j=i 

n 

= ^{aj, - a,,.,){f{V,,,n) - f{Vr,o)) 

n 

= ^(a,-,-«,,_i)(/(^,>)-/(0)) 



n 



n-1 



C^AfiYr.n) - /(0)) - E «i'(/(^,'+l,n) - /(0)) 
j'=l j'=0 
n 

ai'(/(^,',n) - f{Vf+l,n)) + «n(/(K+l,n) " /(0)) 

j'=l 
n 

j'=l 
n 

E«i'(/(^-v;'_i)-/(y-y,0) 
j'=i 



n 



Proof of Corollary |4.12t For notational convenience, let f3h = ^ and /3j = aj for all j > h. It follows from 
Proposition 14.1 ll that 

n n n 

Y = E - E (/(^.',.) - 

j=h+l j'=h+l j=h+l 

n 

= E - /3,'-l)(/(^,',n) - 
f=h+l 



E (/?,'- /3,'-i)(/(F,>)-/(0)) 

71 n—1 

E - /(0)) - E /3/(/(^.'+l,n) - /(0)) 

j'=h+l j'=h 
n 

E - fiVj'+l,n)) - Ph{f{Vh+l,n) - /(0)) +/3n(/(K+l,n) " /(0)) 

j'=h+l 
n 

Y »Afiy - yf-i) - fiv - v^)) - 6{f{v - - /(0)) 

j'=h+i 



□ 
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